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Introns are found in various tRNA genes in all the three kingdonns of life. Especially, 
archaeal and eukaryotic genomes are good sources of tRNA introns that are removed 
by proteinaceous splicing machinery. Most intron-containing tRNA genes both in archaea 
and eukaryotes possess an intron at a so-called canonical position, one nucleotide 3' to 
their anticodon, while recent bioinformatics have revealed unusual types of tRNA introns 
and their derivatives especially in archaeal genomes. Gain and loss of tRNA introns during 
various stages of evolution are obvious both in archaea and eukaryotes from analyses of 
comparative genomics. The splicing of tRNA molecules has been studied extensively from 
biochemical and cell biological points of view, and such analyses of eukaryotic systems 
provided interesting findings in the past years. Here, I summarize recent progresses in 
the analyses of tRNA introns and the splicing process, and try to clarify new and old 
questions to be solved in the next stages. 
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Progress in bioinformatics widens our understanding of struc- 
tural characteristics of tRNA genes (Lowe and Eddy, 1997; 
Sugahara et al, 2006, 2008; Heinemann et al., 2010; Cognat 
et al., 2013). Especially, recent powerful sequence analyses with 
the next generation sequencers accumulate an enormous amount 
of sequence information in tRNA genes through whole genome 
sequencing of non-model organisms from various evolutional 
clades and through metagenome analyses mostly of prokaryotic 
species. In these analyses, introns were found in many tRNA genes 
in genomes among all of the three kingdoms of life (Heinemann 
et al., 2010). In eubacterial genomes, and their relatives, eukary- 
otic organellar genomes, small numbers of tRNA genes harbor 
the group I intron within the anticodon region (Reinhold-Hurek 
and Shub, 1992; Haugen et al., 2005). These introns are spliced by 
a series of phosphoester transfer reactions catalyzed by intronic 
sequences, whose mechanism is somehow related to splicing of 
mRNA. On the other hand, archaeal and eukaryotic nuclear 
genomes have tRNA introns whose splicing is completely depen- 
dent on proteinaceous enzymes (Phizicky and Hopper, 2010; 
Popow et al., 2012; Hopper, 2013). In addition to normal introns, 
their variations have been found in both archaeal and eukary- 
otic genomes. Furthermore, various interesting observations have 
been reported on biochemical and cell biological aspects in pre- 
tRNA splicing machinery in recent years. In this review, I mainly 
handle issues related to these "protein-spliced" tRNA introns and 
their splicing machinery by emphasizing comparison between 
archaeal and eukaryotic systems. 

STRUCTURAL CHARACTERISTICS OF tRNA GENES 
HARBORING INTRONS 

Introns found in archaeal and eukaryotic tRNA genes are mostly 
inserted at one nucleotide 3' to the anticodon, namely posi- 
tion 37/38 in the standard nomenclature (Figure 1) whQe introns 
are also inserted into other parts of tRNA genes in minor cases 



(see below in detail). Because intron insertion at this canonical 
position disrupts the anticodon stem-loop structure, its splicing 
is indispensable for tRNA maturation. On the other hand, the 
canonical intron does not seem to interrupt the overall tRNA 
structure (Figure 1). Indeed, pre-tRNAs with the canonical intron 
were shown to maintain structures of the D and T^C arms, and 
the acceptor stem by chemical and enzymatic probing (Swerdlow 
and Guthrie, 1984; Lee and Knapp, 1985). Structural charac- 
teristics of archaeal and eukaryotic tRNA genes containing the 
canonical intron have some similarity: mostly, 5'- and 3'-splice 
sites are set in short single-stranded segments franked by double- 
stranded stretches. However, close inspection of these structures 
reveal some difference between the two groups, which is derived 
from difference in strategy of splice site recognition by splic- 
ing enzymes. For splicing of pre-tRNAs, both archaebacteria and 
eukaryotes utilize similar sets of enzymes, namely tRNA splicing 
endonuclease (Sen) (Thompson and Daniel, 1990; Trotta et al., 
1997; Li et al, 1998; Akama et al, 2000; Paushkin et al, 2004) and 
tRNA Hgase (Phizicky et al., 1986; Englert and Beier, 2005; Englert 
et al, 2010, 2011; Popow et al, 2011, 2014). Some organisms 
require additional factors for the ligation step (Culver et al, 1997; 
Harding et al, 2008; Popow et al., 2011, 2014). Among these, 
splicing endonuclease is responsible for recognition of splice sites, 
and acts as a decoding engine of tRNA-type splice sites on various 
transcripts (see below in detail). 

INTRON-CONTAINING tRNA GENES IN ARCHAEBACTERIA 

Introns are found in every isodecoder tRNA (tRNAs with the 
same anticodon) genes of sequenced archaeal genomes. On an 
average, ~15% of tRNA genes have introns in the archaeal 
genomes while ratio of intron-containing genes varies from ~8% 
in Euryarchaeota to ~48% in Crenarchaeota (Marck and Grosjean, 
2003; Sugahara et al, 2008; Chan et al, 2011). Length of introns 
ranges from 11 to 129 nt, and its median for each isodecoder 



www.frontiersln.org 



July 2014 I Volume 5 | Article 213 | 1 



Yoshihisa 



Handling tRNA introns 



pre-tRNA with a canonical Intron 

Archaea 3' Eukaryote 3' 

5' 




mature tRNA 

J, amino acid-accepting unit 

»3' 





splicing 




37/38 
decoding unit 



FIGURE 1 I Secondary structures of pre-tRNA and mature tRNA. Typical 
secondary structures of pre-tRNAs with a canonical intron (left) and mature 
tRNA (right) are schematically shown. White circles with three black 
circles, blues circles, and light gray circles represent 5'-exon, intron and 3' 
exon of pre-tRNA, respectively. The anticodon is represented with black 
circles, and CCA tri-nucleotides added at the 3'-terminus of tRNA are 



depicted with dark gray circles. The BHB motif in archaeal pre-tRNA and 
the A-l pair in eukaryotic pre-tRNA are highlighted with orange and yellow 
shadow, respectively. Arrowheads represent splice sites. Mature tRNA is 
illustrated in two different ways. In the right most structure, decoding and 
amino acid-accepting units of mature tRNA are marked with light blue and 
light green shadow, respectively. 



A Archaea B Eukaryote 




FIGURE 2 I Insertion points of non-canonical introns in tRNA genes. 

Insertion points of non-canonical introns in various tRNA genes from 
archaea (A) and from eukaryotes (B) are shown. Positions of intron 
insertion found in certain groups of organisms are color-coded as shown on 
top of the tRNA models. Intron insertion points are summarized from data 
in Sugahara et al. (2008), Sugahara et al. (2009), and Chan et al. (2011) for 
archaea, and Kawach et al. (2005), Soma et al. (2007), and Maruyama et al. 
(2010) for eukaryotes. 



tRNA falls mostly in 12-25 nt except the case of tRNA-TrpccA 
introns (65 nt), which nest a box C/D small RNA (Omer et al, 
2000; Clouet d'Orval et al, 2001; Singh et al, 2004). The hall- 
mark of splice sites in archaeal pre-tRNAs is the bulge-helix-bulge 
(BHB) motif (Kjems and Garrett, 1988; Tang et al., 2002; Marck 
and Grosjean, 2003), and the BHB motif is a critical determi- 
nant for recognition by archaeal splicing endonuclease (Xue et al, 
2006), which consists of a 4 bp double-stranded helix flanked 
with two 3 nt bulges (Figure 1, shadowed with orange). In a pre- 
tRNA, the first two nucleotides of the anticodon base-pair with 
the intron to form a part of the central 4 bp helix, and the two 
adjacent bulges provide the 5'- and 3'-splice sites. The 5'- and 
3'-splice sites may exist as independent entities, such as hBH 
and HBh' motifs, in which splice sites are flanked by two short 
helices, in certain archaeal species (see non-canonical splice sites 
in Figures) (Marck and Grosjean, 2003). This secondary struc- 
ture requirement in the BHB motif restricts a part of the primary 
sequence of the intron while quite large variations in sequence are 
accepted in the other parts of the intron. 

Because of this transplantable nature of the splice sites, introns 
are also found at non-canonical positions of tRNAs, such as 20/2 1 , 
22/23, 25/26, 29/30, 30/31, 45/46, 53/54, 56/57, 59/60, etc., in 
some archaeal genomes (Figures 2A, 3). Especially, Crenarchaeota 
and Nanoarchaeota genomes are rich sources of non-canonical 
introns. In an extreme case, intron is inserted at position 3/4 
in certain Thermoproteales tRNAs (Sugahara et al., 2008). The 
non-canonical intron is sometimes accompanied by the canon- 
ical and other non-canonical introns in the same tRNA genes. 
For example, a tRNA-ProuGG gene in Pyrobaculum islandicum has 
two non-canonical introns inserted at position 25/26 and 56/57 in 
addition to a canonical intron (Figure 3B) (Sugahara et al., 2007). 
Splice sites of these non-canonical introns are found not only in 
the BHB motif but also in the HBh' and/or h'BH motifs (Marck 
and Grosjean, 2003). 

Furthermore, tRNA genes consist of separated transcrip- 
tional units have been identified in Crenarchaeal genomes 



(Randau et al., 2005b, c). "Split tRNA genes" were first reported 
in Nanoarchaeum equitans: in this organism, tRNA-HiscuG 
is encoded by two gene fragments corresponding to 5'- and 
3'-halves separated at position 37/38 in the anticodon loop 
(Figure 3C). The 5'- and 3'-halves are transcribed from their 
own promoters with the trailer and leader sequences whose por- 
tions are complementary to each other. A predicted secondary 
structure of the hybridized fragments is highly similar to that 
of a pre-tRNA harboring a canonical intron with a relaxed BHB 
motif, like a pre-tRNA received a cleavage at a loop in the intron. 
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FIGURE 3 I Unusual tRNA genes. Various examples of unusual tRNA 
genes are represented (A-F). Exonic regions, BHB motifs and splice sites 
are marked as in Figure 1. Introns, leader and trailer sequences removed 
by splicing machinery are marked with bluish or purple-like colors. In panels 
(E,F), linker regions between permutated tRNA fragments are colored in 
green, and cleavage sites are marked with arrows. tRNA secondary 
structures are drawn according to Chan et al. (2011), Sugahara et al. (2007), 
Randau et al. (2005a), Fujishima et al. (2009), and Soma et al. (2007). 



Indeed, RT-PCR analysis revealed that the split tRNA fragments 
are transcribed, and the transcripts are joined to form mature and 
functional tRNAs in vivo, indicating that frans-splicing is oper- 
ating in N. equitans (Randau et al., 2005b, c). Further in vitro 
analyses revealed that splicing endonuclease from N. equitans 
can recognize this "pre-tRNA" complex and cleave off the trailer 
and leader sequences at the precise positions, and the resulting 
RNA is suitable for ligation by tRNA ligase (Randau et al., 2005a; 



Tocchini-Valentini et al, 2005). An interesting case in this organ- 
ism is that tRNA-Glucuc and tRNA-Gluuuc. isodecoder tRNAs 
for the same amino acid, are produced from two different 5'- 
halves and one common 3'-half by fra«s-splicing (Randau et al., 
2005b). Probably, frans-splicing contributes to saving genomic 
space to be assigned to tRNAs and to increasing probability to 
have more isodecoders especially in the case of N. equitans, a par- 
asitic bacterium with massive genome reduction (Makarova and 
Koonin, 2005). Or frans-splicing maybe an evolutionary remnant 
of the ancient form of tRNA gene organization (see below). 

There are more complicated cases. In Caldivirga maquilin- 
gensis, tRNA-Gly isodecoders with anticodons, CCC, UCC, 
and GCC, are formed by combinations of up-to three out of 
five independent transcripts through frans-splicing (Figure 3D) 
(Fujishima et al, 2009; Sugahara et al, 2009). While tRNA- 
Glyccc is made from the 5'- and 3'-halves covering 1-37 and 38- 
73 regions, respectively, like the case of N. equitans tRNA-GlucuCi 
the other two are joined from the 1-25 fragment common for the 
two, either one of the two specific fragments covering 26-37 with 
the anticodons, and the 3'-half used for all of the three tRNA- 
Gly isodecoders. Most of splits in the separated tRNA genes are 
located at canonical position (37/38) while those are at position 
29/30 of tRNA-AlacGC and tRNA-Alaucc, and at 25/26 of tRNA- 
Gluuuc in the C. maquilingensis genome. N. equitans and some 
Staphylothermus genomes harbor spHt tRNA-Lyscuu at position 
30/31 (Fujishima et al., 2009; Chan et al, 2011). For the tri-split 
tRNA genes, splits are usually combination between one canoni- 
cal position and one or more non-canonical positions (Fujishima 
etal, 2009). 

INTRON-CONTAINING tRNA GENES IN EUKARYOTES 

In eukaryotic genomes, most of intron-containing tRNA genes 
have their introns at the canonical position with length of 6- 
133 nt (Lowe and Chan, 2011). Because the nuclear genome of 
a eukaryote contains multiple tRNA genes encoding an isode- 
coder tRNA to allow sufficient supply of the tRNA, the number 
of tRNA genes in a eukaryotic genome does not correspond 
to that of sequence variations of tRNAs. Ratios of intron- 
containing tRNA genes vary significantly among eukaryotes: only 
39 out of 1068 tRNAs genes (3.7%) have an canonical intron 
in Strongylocentrotus purpuratus (sea urchin) while in the infec- 
tious yeast, Cryptococcus neoformans, 132 out of 143 tRNA genes 
(92%) have an intron (Lowe and Chan, 2011). Whether all the 
predicted introns in eukaryotic genomes are spliced properly 
has not been fully confirmed. However, the yeast, Saccharomyces 
cerevisiae, which has 61 intron-containing genes encoding 10 dif- 
ferent isodecoder tRNAs, can splice all the pre-tRNAs with introns 
by the single splicing endonuclease, the Sen complex, and this is 
also true in human (Trotta et al., 1997; Paushkin et al., 2004). 

A clear difference between eukaryotic and archaeal introns in 
tRNA genes is that eukaryotic introns do not have clear local 
motifs specifying splice sites as described on archaeal introns 
(Figure 1). Rather, eukaryotic splicing endonuclease is consid- 
ered to recognize splicing sites of pre-tRNAs by ruler-mechanism, 
in which the Sen complex measures distance between the body 
of tRNA and the splice sites, and this allows more flexible 
sequence selection around splicing sites in eukaryotic pre-tRNAs 
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(Greer et al., 1987; Reyes and Abelson, 1988). Analyses of the sec- 
ondary and tertiary structures of eukaryotic pre-tRNAs indicate 
that many pre-tRNAs have splice sites in single-stranded regions 
adjacent to double-stranded stretches, suggesting a eukaryotic 
site-recognition strategy similar to that of archaeal enzyme to 
some extent (Greer et al, 1987; Reyes and Abelson, 1988). 
Indeed, eukaryotic splicing endonuclease can recognize and splice 
archaeal-type pre-tRNAs with a BHB motif, meaning that eukary- 
otic splicing endonuclease has ability to read local structural fea- 
tures of substrate RNAs (Fabbri et al., 1998; Di Segni et al., 2005). 
However, close inspection of substrate specificity in tRNA splic- 
ing of S. cerevisiae and Xenopus extracts with artificial substrates 
demonstrated that double-stranded regions formed between the 
anticodon loop and the intronic sequence are not prerequisite 
to splicing (Reyes and Abelson, 1988). Thus, for our complete 
understanding of splice site properties essential for Sen recog- 
nition, we need to wait until structures of complexes between 
eukaryotic splicing endonuclease and various pre-tRNAs are 
solved. 

Although short insertions can be found in D or T^C arms 
of various eukaryotic tRNA genes, such insertions are mostly 
found one of many synonymous tRNA genes on a genome. Thus, 
such genes are supposed to be pseudogenes, and non-canonical 
introns in eukaryotic genomes are supposed to be rare. However, 
it was reported that nuclear remnants of enslaved algae, nucleo- 
morphs, in cryptophytes and chlorarachniophytes harbor tRNA 
genes with non-canonical introns in the D and T^C arms in 
addition to non-canonical positions in the anticodon stem-loop 
(Kawach et al., 2005; Maruyama et al., 2010). For example, the 
nucleomorph genome of Guillardia theta contains 9 tRNA genes 
(including one pseudogene) with insertions with length from 
3 to 24 nt at non-canonical positions (Figure 2B). Interestingly, 
their excision, including that of the 3 nt intron in the T*C arm 
of tRNA-CysGCA. tRNA-ValAAC> and tRNA-LeuuAA(pseudo) genes, 
was confirmed by sequencing of RT-PCR products of the algal 
RNAs (Kawach et al, 2005) while we still do not know how 
these non-canonical introns, especially extremely short ones, are 
spliced. Moreover, 4 tRNA genes, such as tRNA-PhecAA genes, 
of this organism harbor two introns, and 2 tRNA genes, elonga- 
tor tRNA-Met (tRNA-eMet) and tRNA-CySGCA> do three introns, 
like the case of Crenarchaeal tRNA genes. Similar non-canonical 
introns are also found in the nuclear genome of a red algae 
Cyanidioschyzon merolae (Soma et al., 2007). Therefore, certain 
eukaryotic splicing endonucleases have ability to recognize splice 
sites inserted at non-canonical positions. In this point of view, 
splicing machinery of C. merolae seems to have more archaea-like 
characteristics, and BHB motifs seem to be critical determinants 
of substrate selection for this splicing endonuclease (Soma et al., 
2013). 

Another unusual tRNAs that were first reported on the 
C. merolae genome are permutated tRNAs, in which normal 5'- 
and 3' -termini are bridged with a short loop, and their 3' -half is 
positioned upstream of the 5'-half (Soma et al, 2007). Six tRNAs 
such as tRNA-GlncuG in C. merolae have its 5'-leader sequence 
before position 38 and its 3'-trailer after position 37. Atypical 
5'- and 3' -termini are also introduced in the D arm of tRNA- 
LeuuAA. and the T*C arm in 4 tRNAs including tRNA-Alaucc 



(Figures 3E,F). RT-PCR analyses revealed that these permutated 
tRNA transcripts are indeed converted into mature and func- 
tional tRNAs via circular intermediates (Soma et al, 2007, 2013). 
The new 3'-trailer and 5'-leader sequences have ability to base- 
pair and produce a splice site-like structure with a BHB motif 
similar to that of archaeal split tRNA transcripts (Randau et al, 
2005b). Thus, splicing machinery is thought to cleave the exten- 
sions from the permutated pre-tRNAs and joined separated ends 
of revers-oriented exons while true 5'- and 3'-termini of the 
mature tRNAs are thought to be generated by the action of termi- 
nal processing enzymes, such as RNase P and tRNase Z. Similar 
permutated tRNA genes have been found in other single-cell 
algae, such as prasinophytes and nucleomorphs in cryptophytes 
(Maruyama et al, 2010). Permutated tRNA genes are not limited 
to eukaryotes. An archaea, Thermofilum pendens, also has permu- 
tated tRNA-TyrcuA and initiator tRNA-Met (tRNA-iMet) genes 
with new 5'- and 3'-termini formed in the T*C arm (Chan et al, 
201 1). However, distribution of permutated tRNA genes is so far 
limited in only certain clades of eukaryotes and archaea. 

SPLICING MACHINERY 
SPLICING ENDONUCLEASE 

As mentioned previously, splicing endonuclease is a search engine 
for tRNA-type splices sites among various transcripts produced 
by an organism. One splicing endonuclease, or an endonuclease 
complex, in an organism cleaves both the 5'- and 3'-splice sites 
of an intron to produce two 5'-OH termini on the intron and 
3'-exon, and two 2', 3'-cyclic phosphate termini on the 5'-exon 
and intron, meaning that the reaction is phosphoester transfer 
but not hydrolysis. The archaeal endonucleases are classified into 
three types from subunit organization; a^, ai and ai^i (Tocchini- 
Valentini et al., 2005; Calvin and Li, 2008). The archetypal con- 
figuration of the enzymes is supposed as an a4 homotetramer. 
The crystallographic analysis of EndA from Methanococcus jan- 
naschii revealed that this tetrameric enzyme consists of a 2-fold 
but not 4-fold symmetric structure, where two subunits are 
mainly used to build two reaction centers for splice site cleav- 
age, and the other two are used as structural components to 
position the catalytic subunits in an appropriate spatial arrange- 
ment for accepting the BHB motif (Lykke- Andersen and Garrett, 
1997; Li et al., 1998). The a2 type is a kind of "a dimer of 
dimers," in which two tandemly duplicated endonuclease units 
in a polypeptide act as a catalytic unit and a structural unit, 
respectively (Kleman-Leyer et al., 1997; Li and Abelson, 2000). 
The resulting dimer shows a configuration similar to that of 
a4 enzymes. The a2P2 type is also a derivative of a4 (Mitchell 
et al, 2009; Yoshinari et al, 2009). Gene duplication and dif- 
ferent requirements for catalytic and structural subunits have 
led sequence divergence of the two subunits. Appearance of cer- 
tain subunit types is well correlated with phylogenetic relation 
of archaebacteria and, more importantly, with diversity of tRNA 
introns and splice sites (Tocchini-Valentini et al., 2005; Calvin 
and Li, 2008). Species in Euryarchaeota and Korarchaeota essen- 
tially have a^- or a2-type splicing endonucleases, while those in 
Nanoarchaeota, Thaumarchaeota, and Crenarchaeota have a2?>2- 
type enzymes. Interestingly, the latter group of archaea is rich 
in tRNA genes with introns: most unusual tRNA genes, such as 
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those with non-canonical introns, with multiple introns or with 
permutated configuration, are found in these organisms (Marck 
and Grosjean, 2003; Sugahara et al, 2008; Chan et al, 2011). 
Some of the splice sites in these tRNA genes comprise expanded 
formats of the BHB, such as hBH or HBh' (Tocchini-Valentini 
et al., 2005). On the other hand, Euryarchaeota, mostly possessing 
a4- or a2-type enzymes, has low ratios of tRNA genes with introns 
(less than 10%), and these tRNA genes have only one intron with 
the BHB motif at the canonical position. Therefore, subunit orga- 
nization of splicing endonuclease seems to be a determinant for 
the substrate spectrum of the enzyme; strict to the BHB motif at 
the canonical position in 04 and a2 enzymes, and lenient in ai^z 
enzymes. 

Eukaryotic splicing endonuclease consisting of four subunits, 
namely Sen2, Sen34, Sen54, and Senl5, was first identified in 
the yeast S. cerevisiae (Rauhut et al., 1990; Trotta et al., 1997). 
Among the four subunits, Sen2 and Sen34 have catalytic centers 
that are responsible for cleavage of 5'- and 3'-splice sites, respec- 
tively. On the other hand, Sen54 was reported to interact with the 
D-arm of substrate pre-tRNAs and to position catalytic subunits 
away from the body of tRNA with appropriate distances (Trotta 
et al, 2006; Xue et al., 2006). Human endonuclease contains 
homologs of all the 4 Sen proteins found in S. cerevisiae (Paushkin 
et al, 2004), and Arabidopsis thaliana also possesses Sen2 and 
Sen34 homologs (Akama et al., 2000). There is appreciable con- 
servation among these eukaryotic Sen2 and Sen34 subunits, and 
archaeal endonucleases, indicating the same evolutional origin of 
splicing machinery. The human Sen complex also contains Clpl 
(hClpl) (Paushkin et al., 2004; Weitzer and Martinez, 2007). Clpl 



was first identified as a component of the cleavage factor II for 
polyadenylation of pre-mRNAs (de Vries et al., 2000), and then 
re-identified as an in vitro kinase for tRNA exons when searching 
for an enzyme phosphorylating siRNAs displaying a 5'-OH group 
(Figure 4) (Weitzer and Martinez, 2007). Divergence of sub- 
unit organization is greater in eukaryotic endonuclease than in 
archaeal enzymes, and eukaryotic splicing endonucleases have the 
ability to cleave both eukaryotic and archaeal types of splice sites 
as described above. Thus, it can be said that subunit organiza- 
tion diversity is, again, correlated with local splice site variability 
even between different kingdoms of life. Although direct relation 
between repertoires of intron-containing tRNA genes and subunit 
configuration of splicing endonucleases among eukaryotes has 
not been analyzed extensively, one interesting example is C. mero- 
lae. Bioinformatic analysis successfully identified genes for Sen2, 
Sen34, and Sen54 but failed to identify any gene or gene fragment 
that may encodes the fourth essential subunit SenlS (Soma et al., 
2013). In addition, yeast two-hybrid analyses could not identify 
other proteins that interact with either one of the three C. mero- 
lae Sen proteins, suggesting that the Sen complex of this organism 
may consist of 3 but not 4 subunits. Although further study is 
needed, such unique subunit organization in C. merolae may be 
related to existence of tRNA genes with non-canonical introns 
and permutated tRNA genes in this organism. 

ENZYMES CATALYZING LIGATION STEPS 

Excised tRNA exons are joined by tRNA ligase. There are two 
completely different chemical pathways of ligation classified from 
the origin of the phosphate bridging the 5'- and 3'-exons: 



5'-phosphate ligation patliway 




? HSPC117>Archease>DDX1 HSPC117 

RtcB RtcB'Archease 

3'-piiosphate ligation pathway 



FIGURE 4 I tRNA splice site cleavage and two pathways of tRNA 
ligation. Series of cliemical reactions in splice site cleavage of 
pre-tRNA (left), the 5'-phosphate ligation pathway (upper), and the 
3'-phosphate ligation pathway (lower) are shown schematically. Proteins 
involved in individual reactions are also shown, and unidentified factors 
are denoted by question marks. Those proteins of yeast, mammal and 
prokaryotes are color-coded with black, blue and orange, respectively. In 
the 5'-phosphate ligation pathway, yeast Tril uses GTP as a phosphate 
donor while hClpl in the mammalian system uses ATP instead of GTP 



for 5'-phosphorylation of the 3'-exon. Although a mammalian enzyme(s) 
catalyzing the last two steps of this pathway is still missing, such an 
enzyme was found in lancelet. In the 3'-phosphate ligation pathway, the 
mammalian protein responsible for 2'-3' cyclic phosphodiesterase in the 
HSPC117 complex was not fully identified, but eubacterial RtcB, an 
HSPC117 homolog, was demonstrated to possess this activity (Tanaka 
and Shuman, 2011). The starting material, pre-tRNA, and the end 
product, mature tRNA, are shadowed. Appr>Pi represents ADP-ribose 
1"-2" cyclic phosphate. 
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one is the 5' -phosphate ligation pathway and the other is the 
3'-phosphate ligation pathway (Figure 4) (Popow et al, 2012). 
At the first step, ligation systems need to open the 2', 3'-cydic 
phosphodiester on the 5'-exon. The 5'-phosphate ligation path- 
way produces the 3' terminus with 2'-phosphate and 3'-OH 
(Phizicky et al., 1986; Sawaya et al., 2003; Wang and Shuman, 
2005) while the 3'-phosphate ligation pathway does that with 
2'-OH and 3'-phosphate (Popow et al, 2011; Chakravarty et al, 
2012). The former does not use the 2' -phosphate but a new phos- 
phate derived from a nucleotide triphosphate to form a bridge 
between the two exons (Westaway et al, 1993), while the lat- 
ter utilizes the 3'-phosphate left on the 5'-exon (Popow et al., 
2011; Chakravarty et al, 2012). In both cases, the phosphorylated 
exon is further activated by nucleotidylation, and then ligated to 
its counterpart using the nucleotide monophosphate as a leaving 
group (Phizicky et al., 1986; Westaway et al., 1993; Popow et al, 
2011; Chakravarty et al, 2012). In the case of the 5'-phosphate 
ligation pathway, the 2'-phosphate left at the splice junction must 
be removed after the ligation (Culver et al, 1997; Harding et al., 
2008). 

5' -phosphate ligase 

The 5' -phosphate ligation pathway has been found widely in 
fungi and plants. It also operates in lancelet but is most proba- 
bly absent in mammalian cells (Lappe-Siefke et al, 2003; Harding 
et al, 2008; Englert et al, 2010; Popow et al., 2011). tRNA 
ligase for 5' -phosphate ligation, Trll/Rlgl, was first identified 
in S. cerevisiae and then in A. thaliana through biochemical 
analysis (Phizicky et al., 1986; Englert and Beier, 2005). Trll 
homologs are widely found in the genomes from fungi through 
diatoms to angiosperms but are absent in animals (Englert and 
Beier, 2005; Wang et al., 2006). Trll homologs catalyze all the 
chemical reactions of exon ligation except for removal of the 
2'-phosphate at the splice junction, and consist of three domains 
covering three enzymatic activities; adenylate synthetase/RNA 
ligase (ASTase), polynucleotide kinase (PNKase), and 2', 3'- 
cyclic phosphodiesterase (CPDase) in this order (Xu et al., 1990; 
Sawaya et al., 2003). CPDase opens the 2', 3'-cyclic phosphate on 
the 3'-terminus of the 5'-exon, and PNKase phosphorylates the 
5'-terminus of the 3'-exon with GTP. By transferring an AMP 
moiety to this 5' -terminus from ATP, ASTase activates this end 
and allows ligation between the two exons (Figure 4, upper). 
During this terminal activation, the AMP moiety is first cova- 
lently attached to the ASTase domain and then transferred to 
the substrate. Although splicing endonuclease is responsible for 
primary recognition of splice sites, yeast Trll has the ability to 
interact with introns in pre-tRNAs (Tanner et al., 1988) and 
with an intron-containing precursor of its non-tRNA substrate, 
HACl mRNA acting as a key component for the unfolded pro- 
tein response (Sidrauski et al., 1996; Mori et al, 2010). The 
2'-phosphate left at the splice junction is finally removed by 
2'-phosphotransferase, or Tptl in S. cerevisiae (Culver et al, 
1997). This reaction is unique in that 2'-phosphate is trans- 
ferred to NADH to yield ADP-ribose l"-2" cycHc phosphate and 
nicotinamide (Figure 4, right most part). Indeed, this enzyme is 
essential for yeast growth, indicating that the 2' -phosphate is toxic 
for protein translation. 



Although the 5' -phosphate ligation activity was once demon- 
strated in the nuclear extract of HeLa cells (Zillmann et al, 
1991), a vertebrate 5'-phosphate ligase has not yet been iden- 
tified. A lancelet Branchiostoma floridae has a complete set of 
enzymes for this ligation pathway; adenylate synthetase/RNA 
ligase activities and the other two activities for the tRNA lig- 
ation are encoded by two independent genes (Englert et al., 

2010) . In vertebrates, several polypeptides in different complexes 
can catalyze some of the individual reaction steps responsible 
for a putative 5'-phosphate ligation. First, in vitro phosphory- 
lation of the 5'-terminus of the 3'-exon can be done by hClpl 
kinase associated with the human Sen complex (Weitzer and 
Martinez, 2007). Second, mammalian genomes harbor genes for 
2H CPDase, which can open the 2', 3'-cyclic phosphodiester 
bond at the 3'-terminus of the 5'-exon to produce 3'-OH and 
2'-phosphate groups, and the CPDase domain of fungal and 
plant Trll is classified as this family (Popow et al., 2012). Third, 
mammals possess 2' -phosphotransferase activities for RNA sub- 
strates. However, both mouse 2H CPDase and Tptl (Cnpl and 
Trptl, respectively) are non-essential for viability (Lappe-Siefke 
et al., 2003; Harding et al, 2008). Especially, Trptl -knock- 
out mice, which completely lacks 2' -phosphotransferase activity, 
were demonstrated not only to translate Tyr-rich proteins effi- 
ciently, which require appropriate splicing of the tRNA-TyrcuA 
intron but also to show no defects in splicing of XBPl mRNA 
(Harding et al, 2008), which undergoes tRNA-type unconven- 
tional splicing in the cytoplasm upon unfolded protein response, 
like yeast HACl mRNA (Yoshida et al, 2001; Calfon et al, 
2002). Therefore, even if vertebrates have the complete set of 
the enzymes for the 5' -phosphate ligation pathway, this path- 
way seems to contribute to only a small, if any, part of RNA 
splicing. 

3' -phosphate ligase 

In vertebrates, a tRNA ligase activity utilizing the 2', 3'-cyclic 
phosphate to bridge the tRNA exons has been known for decades, 
but the enzyme responsible for this reaction was found only a few 
years ago (Filipowicz and Shatkin, 1983; Laski et al., 1983; Popow 
et al., 2011). Indeed, the 3'-phosphate ligase HSPC117 is a main 
player for tRNA splicing in mammals, and HSPC117 homologs 
are found in vertebrates, lancelets, insects, protozoa, algae etc., 
but not in ftingi or angiosperms, which possess Trll homologs 
(Popow et al, 201 1, 2012 for a review). Although HSPCl 17 forms 
a large complex with DDXl (RNA helicase), CGI-99, FAM98B, 
and ASW, whether all of these subunits are involved in tRNA 
ligation is not known completely (see below). Homologs of 
HSPCl 17, whose name in prokaryotes is RtcB, exist widely in 
archaeal genomes where no Trll homologs exist (Englert et al., 

2011) . Interestingly, RtcB genes are also found in eubacterial 
genomes despite the fact that eubacteria do not have tRNA genes 
with introns removed by protein-assisted splicing. And eubacte- 
rial RtcB genes are located in operons with genes encoding RNA 
3'-phosphate cyclase, which transforms a 3'-phosphate of an RNA 
molecule into a 2', 3'-cyclic phosphate (Tanaka and Shuman, 
20 1 1 ). It is suggested that the set of enzymes is used to repair dam- 
aged tRNAs whose anticodon loop is endonucleolyticaUy cleaved. 
These facts may indicate that the 3'-phosphate ligation pathway 
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is a predominant and probably primordial pathway for tRNA 
ligation. 

As mentioned above, HSPC117/RtcB also has adopted 
nucleotidylation to activate an exon terminus, but the ligase 
transfers a GMP moiety, instead of AMP, to the 3' -terminus of 
the 5' exon (Figure 4, lower) (Chakravarty et al, 2012; Popow 
et al, 2014). During the reaction, the GMP moiety is covalently 
attached to the ligase, and this activation step seems to be rate lim- 
iting to the overall ligase cycle in eubacterial RtcB (Chakravarty 
et al, 2012). This is also true in human ligase HSPC117, and 
these facts suggest that some factor(s) is required for efficient 
turnover of 3'-phosphate ligase. Bioinformatic search for clus- 
ters of eukaryotic orthologous groups conserved in the same 
model organisms as RtcB and biochemical confirmation revealed 
that a small protein named "archease" enables HSPC117 to cat- 
alyze multiple-rounds of ligation reaction (Popow et al., 2014). 
Detailed biochemical analyses revealed that archease specifically 
enhances GMP transfer from GTP to HSPC117 protein but not 
that from the HSPC117-GMP adduct to the 3'-terminus of RNA 
substrates. Furthermore, this step also requires another activ- 
ity catalyzed by DDXl, an RNA helicase within the HSPC117 
complex. The implication of DDXl ATPase provides evidence 
for molecular mechanism of ATP hydrolysis in the mammalian 
ligation reaction (Popow et al., 201 1, 2014). Interestingly, involve- 
ment of archease in tRNA processing was first reported on m^C 
formation of tRNA in archaea Pyrococcus abyssi, where archease 
enhances substrate specificity of m^C methyltransferase and pre- 
vents it from aggregation probably by acting as a chaperone 
(Auxilien et al., 2007). Recently, archease was also demonstrated 
to catalyze efficient activation of archaeal RtcB where it enhances 
all nucleotidyl transfer steps in the 3' -phosphate ligation, includ- 
ing GMP transfer to RNA molecules (Desai et al, 2014). 

DIVERSITY OF SPLICING MACHINERY IN EUKARYOTES; 
INTRACELLULAR LOCALIZATION AND FUNCTION 

As mentioned above, eukaryotic splicing factors are more com- 
plex than those of archaea. According to this complexity, the 
splicing factors in various eukaryotes have acquired diversity in 
several aspects. One interesting aspect is the place of splicing 
in eukaryotic cells. Despite the functional and structural con- 
servation of eukaryotic splicing endonucleases, their intracellular 
localization is divergent among organisms. Splicing endonucle- 
ases in vertebrates show expected localization: they are mainly 
found in the nucleus (De Robertis et al., 1981; Paushkin et al., 
2004). Thus, it has been suggested that pre-tRNAs are spliced 
in the nucleus and only matured tRNAs are supplied to the 
cytoplasm. On the other hand, in S. cerevisiae, splicing of pre- 
tRNAs proceeds in the cytoplasm: tRNA splicing endonucle- 
ase was found to associate with mitochondrial surface, and 
this association is required for proper function of the enzyme 
(Yoshihisa et al., 2003, 2007). Later, the other enzymes required 
for completing splicing were also demonstrated to localize in 
the cytoplasm (Mori et al, 2010). It is to be noted that cyto- 
plasmic localization of the Sen complex itself is not essential 
for pre-tRNA splicing for the yeast. When all the Sen sub- 
units are expressed in the nucleus, pre-tRNAs are spliced nor- 
mally (Dhungel and Hopper, 2012). These findings suggest that 



eukaryotic cells can tolerate drastic alteration of the place of tRNA 
splicing. 

Although we have not obtained a complete view of tRNA 
splicing machinery in plant cells, some subunits of Sen proteins 
and tRNA ligase from A. thaliana were revealed to have multi- 
ple destination signals (Englert et al., 2007). When expressed as 
GFP fusions, AtSen2 and AtSenl localized to the nucleus and 
mitochondria while AtTrll did to the nucleus and chloroplasts. 
Especially in the case of AtTrll, alternative translational initiation 
may produce two isoforms; one from the most up-stream AUG 
is targeted to chloroplasts and the other from downstream AUG 
is to the nucleus (Englert et al, 2007). Thus, the nucleus may be 
the primary site of pre-tRNA splicing of nuclear encoded tRNAs 
in plant cells. On the other hand, one report provided a piece of 
evidence against this notion: unspliced pre-tRNAs accumulated 
when tRNA export from the nucleus was suppressed by RNAi- 
mediated knock-down of the tRNA export carrier, Exportin- 
t/Paused (Park et al., 2005). This situation is quite similar to that 
of S. cerevisiae, where disruption of LOSl, which encodes the 
yeast homolog of Exportin-t, leads accumulation of pre-tRNAs 
because of their sequestration from cytoplasmic splicing enzymes 
(Sharma et al, 1996; Yoshihisa et al, 2003). What is the role 
of mitochondrial or chloroplastic splicing enzymes in plants? 
Since there are no tRNA genes harboring eukaryote-archaea type 
introns in mitochondria and chloroplasts, plant splicing enzymes 
in these organelles should process a different type(s) of RNAs. 

Evolution of eukaryotes also has hooked extra functions up to 
splicing machinery. As mentioned above, the yeast Sen complex 
can be transplanted from mitochondria to the nucleus without 
major defects in tRNA splicing. However, partial deletion of the 
mitochondrial targeting signal in Sen54 leads temperature sen- 
sitivity in yeast growth, and transplantation of the whole Sen 
complex to the nucleus compromises growth and rRNA mat- 
uration (Yoshihisa et al., 2003; Dhungel and Hopper, 2012). 
Thus, the yeast splicing endonuclease should have another essen- 
tial function in the cytoplasm. Human splicing endonuclease 
activity also may have an extra function(s) other than pre- 
tRNA splicing. Mutations in human SEN2, SEN34, and SEN54 
{TSEN2, TSEN34, and TSEN54 respectively) lead to pontocere- 
bellar hypoplasia, which causes specific neurodegenerative dis- 
orders including hypoplasia of the cerebellum and the ventral 
pons, despite the fact that tRNA splicing is essential for every cell 
in the human body (Budde et al, 2008; Namavar et al, 2011). 
Similar phenotypes were observed when a SEN54 homolog was 
knocked down in zebrafish (Kasher et al., 2011). Interestingly, 
this disease is also caused by mutations of mitochondrial arginyl- 
tRNA synthetase (Edvardson et al, 2007; Namavar et al, 2011). 
Although targets of human Tsen in the pontocerebellar cells 
related to this disease have not been identified so far, these 
facts suggest that Tsen has some neuronal cell specific func- 
tion, other than usual tRNA splicing, through collaboration with 
mitochondrial arginyl-tRNA synthetase, another tRNA-related 
factor. Probably, because of their relaxed substrate recognition 
and expansion of repertoires of transcripts in eukaryotic cells, 
the Sen complexes from various eukaryotes may have adopted 
abilities to process and/or degrade various substrates in addi- 
tion to intron-containing pre-tRNAs. Mutations in vertebrate 
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Clpl were also found to cause neurodegenerative phenotypes, 
such as motor-sensory defects, cortical dysgenesis and micro- 
cephaly, like the case of Tsen mutants (Hanada et al., 2013; 
Karaca et al, 2014; Schaffer et al., 2014). Mouse Clpl-K127A 
(kinase-dead) and hClpl-R140H (disease-related) compromise 
both pre-tRNA cleavage and 5'-RNA kinase activities in vitro 
while they specifically affect neuronal cells in vivo in mouse and 
human, respectively (Hanada et al., 2013; Karaca et al, 2014). 
Similar phenotypes were seen in CLPl mutants of zebrafish 
(Schaffer et al, 2014). The above facts suggest that, in vivo, 
residual activities of these Clpl mutant proteins cannot com- 
plete tasks required in the neuronal cells while they can provide 
enough tRNA splicing ability in other cell types. Although fun- 
damental functions of mammalian Clpl, as a sub unit of the Sen 
complex or as a component of Cleavage Factor II generating 
mRNA 3'-ends, are carried out in the nucleus, some reports pre- 
dict an extra-function in the cytoplasm. As mentioned before, 
hClpl also acts as a 5' -kinase for siRNAs (Weitzer and Martinez, 
2007). Phosphorylation of artificial siRNAs supplied from the 
outside of the cells may occur in the cytoplasm because only 
phosphorylated double stranded siRNAs are loaded to Ago pro- 
teins to yield RNA-induced silencing complexes in the cytoplasm 
while double stranded RNA import across the nuclear enve- 
lope has not been known (Nykanen et al, 2001; Ameres and 
Zamore, 2013). Indeed, Tsen-hClpl can be purified from the 
cytosolic fraction (Weitzer and Martinez, 2007; Karaca et al., 
2014). 

High-order functions are also postulated for mammalian 
3'-phosphate ligase. The HSPC117 complex was independently 
isolated as one of the components that allow axonal trans- 
port of mRNAs in neurons, suggesting that the HSPC117 has 
another function in the cytoplasm (Kanai et al., 2004). Indeed, 
the HSPC117 complex was recently found to shuttle between 
the nucleus and the cytoplasm (Perez-Gonzalez et al, 2014). A 
versatile RNA helicase DDXl in this complex may account for 
such functions related to mRNA dynamics. As discussed above, 
both 5' -phosphate and 3' -phosphate tRNA ligases may also act as 
healing-sealing enzymes for damaged tRNAs. 

PHYSIOLOGICAL MEANINGS TO HAVE AN INTRON IN A 
tRNA GENE 

PHYSIOLOGICAL FUNCTION OF INTRON IN PRE-tRNA DURING ITS 
MATURATION 

The tRNA intron is indeed an obstacle for tRNA's function, and 
seems to exist just to be removed after transcription. Why do 
organisms need to keep the intron on their tRNA genes? What 
are roles of the introns? So far, there are no clear answers to 
these questions. However, there are several examples that indi- 
cate some specific effects of tRNA introns on wide variety of 
aspects in the life of tRNAs and in the life of archaebacteria and 
eukaryotes. 

First, involvement of tRNA introns as recognition motifs 
for tRNA modification enzymes has been known in eukaryotes 
(Gros)ean et al., 1997). Mainly from in vitro analyses, it was 
found that certain modifications in the anticodon loop are strictly 
applied on intron-containing pre-tRNAs. One famous example 
is that pseudouridylation of the 1st and 3rd U of the anticodon 



(U34 and U36) of tRNA-UeuAU in S. cerevisiae (Szweykowska- 
Kulinska et al, 1994; Simos et al, 1996; Motorin et al, 1998). 
tRNA-IleuAU needs to distinguish the AUA codon for He from 
AUG for Met. Conversion of UAU into *A* is thought to 
enhance selectability to A against G at the wobble position. 
This pseudouridylation is applied only to intron-containing pre- 
tRNA-IleuAU by pseudouridine synthase Pusl in vitro (Figure 5A, 
upper) (Simos et al., 1996; Motorin et al., 1998). In. S. cerevisiae, 
Pusl localizes in the nucleus, and the splicing machinery does in 
the cytoplasm. Thus, the order of these processing events is guar- 
anteed by the difference in intracellular localization (Figure 5A, 
lower). Interestingly, Pusl also pseudouridylates at positions 25 
and 67 of both spliced and unspliced forms of tRNA-IleuAU- 
A similar example of intron-dependent modification is 2'-0- 
methylation of C34 in tRNA-LeucAA by Trm4 (Grosjean et al, 
1997). Both yeast and human Trm4 methyltransferases require 
the intron in the substrate, tRNA-LeucAA; in vitro (Strobel and 
Abelson, 1986; Brzezicha et al., 2006). The intron is also used 
to avoid premature modification during the series of modifi- 
cation reactions. tRNA-PhecAA) derived from intron-containing 
genes, has an unusual nucleotide wybutosine (yW) at position 
37 (Blobstein et al., 1975). The formation of yW starts with 
methylation of G37 by Trm5 in S. cerevisiae. Trm5 only recog- 
nizes the sliced but not intron-containing form of tRNA-PhecAA 
(Figure 5B, upper) (Noma et al., 2006). Again, the spliced form 
of tRNA-PhecAA first appears in the yeast cytoplasm because of 
the cytoplasmic localization of the splicing machinery. However, 
the spliced intermediate must be re-imported into the nucleus 
for this methylation since Trm5 localizes in the nucleus (Ohira 
and Suzuki, 2011). It has been shown that various tRNA species 
constantly shuttle between the cytoplasm and the nucleus, and 
the import system can deliver the spliced intermediate to the 
nucleus (Shaheen and Hopper, 2005; Takano et al., 2005; Shaheen 
et al., 2007; see also review, Hopper, 2013). After methylation 
by Trm5, tRNA-PhecAA with m'G37 is transported back to the 
cytoplasm to receive a series of chemical reactions to build yW 
at position 37 (Figure 5B, lower) (Ohira and Suzuki, 2011). In 
this case, the tRNA intron acts as a determinant of timing of 
modification. 

In archaea, there is a unique example in which tRNA intron 
itself encodes a functional unit for modification of the body of 
the tRNA. In archaeal tRNA-TrpccA precursor, a box C/D small 
RNA is nested in the intron, and this intronic part of pre-tRNA- 
TrpccA or the excised intron is used to select nucleotides C34 and 
U39 for 2'-0-methylation in trans (Figure 5C) (Omer et al., 2000; 
Clouet d'Orval et al, 2001; Singh et al, 2004). Although these 
modifications in eukaryotes and archaea seem to be the driving 
force to maintain the introns in tRNA genes, the fact that Pusl 
and Trm4 are dispensable for viability of S. cerevisiae is against 
this assumption (Simos et al., 1996; Motorin and Grosjean, 1999). 
It was also demonstrated that the intron of tRNA-TrpccA can 
be removed from the genome of Haloferax volcanii, suggesting 
that modification assisted by the intronic sequence itself does not 
cause strong selective pressure for intron maintenance (Joardar 
et al., 2008). Thus, no essential modifications only applied to 
intron-containing pre-tRNAs have been identified in eukaryotes 
and archaea so far. 
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FIGURE 5 I tRNA introns affect various biological activities as RNA and 
DNA units. Several examples where tRNA introns affect various biological 
activities as RNA units (upper) and as DNA units (lower) are illustrated. (A) 
The intron of tRNA-lleuAU in S. cerevisiae is an essential recognition motif for 
Pusl. Thus, pseudouridylation of U34 and U36 in the anticodon by Pusi is 
only applied to the intron-containing pre-tRNA. Pre-tRNA-lleuAu 
pseudouridylated by Pusl in the nucleus is subjected to cytoplasmic splicing 
after export from the nucleus in the yeast. (B) Yeast Trm5, a nuclear 
methyltransferase catalyzing the initial step of yW formation, only recognizes 
a spliced form of tRNA-PhecAA. which is produced in the cytoplasm and 
re-imported into the nucleus. Thus, yW formation starts at a late stage of 



tRNA-PhOQAA maturation. (C) The intron of tRNA-TrpccA in H. volcanii is used 
as a guide RNA to select two nucleotides, C34 and U39, of the same tRNA 
for 2'-0-methylation. (D) Sulfolobus SSVs use a part of tRNAs as an attB site 
for its integration into the host genome. Some Sulfolobus species harbor an 
intron in such tRNA genes to disrupt attB sequences. This may be a bacterial 
strategy to avoid viral infection. (E) tRNA-ThrAGu "sar ths HMR-E locus, 
which is in a heterochromatic region, acts as an insulator and prevents 
expansion of the heterochromatic region over the tRNA gene by recruiting 
TFIIIC properly in S. cerevisiae. The intron-containing tRNA-LeucAA gene 
cannot replace tRNA-Thr^Gu while the intronless version of the same tRNA 
can. See the text for details. 



POSSIBLE FUNCTIONS OF INTRON IN tRNA GENES ON THE 
CHROMOSOME 

The intron may affect functions or abilities of tRNA genes on 
chromosomes. Ahhough the primary function of tRNA genes 
is the source of genetic information to produce tRNAs, their 
well-structured and conserved sequences are utilized as targets of 
integration of viral genomes and other mobile genetic elements. 
Randau and Soil proposed that intron insertion will prevent 
tRNA genes from viral genome integration (Randau and Soil, 
2008). Indeed, many archaeal viruses have been isolated from 
extreme environments, which tRNA-intron-rich archaea also pre- 
fer (Rice et al., 2001; Prangishvili, 2013). In well-studied cases in 
archaea, such viruses chose tRNA genes as integration sites. For 
example, Sulfolobus spindle-shaped viruses (SSVs) belonging to 
the Fuselloviridae family use either tRNA-LeuGAG> tRNA-AspcuC) 
tRNA-GluuuC; or tRNA-Glucuc as their attB sites, and they inte- 
grate their genome into the anticodon loop and T*C arm of these 
tRNA genes (Wiedenheft et al., 2004). By possessing an intron 
at or near the canonical position, or dividing tRNA genes into 



halves, such tRNA genes may escape from integration of these 
viruses (Figure 5D). However, there are other tRNA genes can be 
used as attB sites for these viruses, so that avoidance of viral infec- 
tion may not be the sole reason to gain and/or maintain tRNA 
introns on archaeal genomes. 

In eukaryotes, promoter elements are embedded in the coding 
region of tRNA genes, namely the A box and B box (Figure 5E) 
(Galli et al., 1981; Geiduschek and Kassavetis, 2001). These ele- 
ments must be recognized by general transcription factor com- 
plexes in order for RNA polymerase III, especially TFIIIC, to 
initiate efficient transcription. Since the distance between the A 
and B boxes is altered by insertion of the intron, introns may affect 
recognition of the promoter elements by TFIIIC. Indeed, some 
early reports demonstrated that intron removal from reporter 
tRNA genes affects expression levels of some tRNAs in Xenopus 
oocytes in vivo and in yeast extracts in vitro, supporting the above 
assertion (Ciliberto et al., 1982; Fabrizio et al., 1987). On the 
other hand, other reports argue that yeast TFIIIC can bind to 
the tRNA promoter elements with various distance through its 
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flexible linker between two recognition domains (Schultz et al., 
1989; Gamier et al., 1990). Recent genome-wide study of TFIII 
occupancy using ChlP/Seq analyses also revealed that rather even 
occupancy of Tfcl (TFIIIC subunit) and Brfl (TFIIIB subunit) 
on various tRNA genes (Nagarajavel et al., 2013). Therefore, alter- 
ation of distance between the A and B boxes by intron insertion 
per se may cause only minor effects on transcription of tRNA 
genes in general. 

On the other hand, there is one example to indicate the effect 
of intron insertion on TFIIIC recognition of the tRNA promoter 
elements. It was reported that tRNA genes act as insulators or 
nucleosome phasing barriers (Donze et al., 1999). For this unique 
activity, the promoter elements of the tRNA genes must be prop- 
erly occupied by TFIIIC (Simms et al, 2008). A tRNA-ThrAcu 
gene, which does not have an intron, next to the yeast HMR-E 
locus acts as an insulator to block propagation of heterochro- 
matin over this gene. On the other hand, a tRNA-LeucAA gene, 
which has a 19nt-long intron, cannot act as an insulator, and 
removal of the intron brings insulator activity to the tRNA gene 
(Figure 5E) (Donze and Kamakaka, 2001). Therefore, possession 
of the intron can affect the additional function of tRNA genes 
related to genome organization. 

DIRECT EXAMINATION OF INTRON REQUIREMENT 

Essentiality of keeping introns in tRNA genes for organisms can 
be directly assessed by constructing strains without tRNA introns. 
As mentioned above, in archaea, one report demonstrated that 
the intron in the tRNA-TrpccA gene is dispensable for viability 
of H. volcanii in spite of its critical function in 2'-0-methylation 
of the tRNA (loardar et al, 2008). By expanding similar analy- 
ses in certain archaea, it may be tested whether every tRNA gene 
harboring an intron(s) on an archaeal genome needs to possess 
the intron(s). On the other hand, the situation is not so simple in 
eukaryotes. As mentioned previously, most of isodecoder tRNAs 
are encoded by degenerated genes with the same or very simi- 
lar sequences in eukaryotes. For example, even simple eukaryotes, 
such as yeasts, has more than 5 degenerated genes for each isode- 
coder tRNA on average. In S. cerevisiae, numbers of degenerated 
genes encoding intron-containing isodecoders varies from one 
for tRNA-SercGA to 10 for tRNA-LeucAA> tRNA-PhecAA, and 
tRNA-ProuGG (Lowe and Chan, 2011). Thus, only one isode- 
coder, tRNA-SercGA) was examined for dispensability of its intron 
in 1980s despite versatility of the yeast in chromosomal modi- 
fication through homologous recombination (Ho and Abelson, 
1988). Indeed, it was found that the intron in the tRNA-SercGA 
gene can be removed from the yeast chromosome without any 
apparent growth defects. However, the sequence of tRNA-SercGA 
is quite similar to that of tRNA-SeruGA; only three nucleotides 
are different between the two. Overproduction of tRNA-SeruGA 
can suppress lethality of deletion of the tRNA-SercGA gene, and 
this suppression requires ncm^U modification at U34 of tRNA- 
SeruGA to expand its recognition repertoire at the wobble position 
(Johansson and Bystrom, 2004; Johansson et al., 2008). These 
facts suggest functional redundancy between tRNA-SercGA and 
ncm^U-modified tRNA-SeruGA and that effects of intron removal 
from the tRNA-SerccA gene may be masked by this redundancy. 
A stricter test was done recently with tRNA-TrpccA> which is 



the only isodecoder to decode a single UGG codon for Trp and 
encoded by six genes with the same sequences by our group (Mori 
et al., 2011). Complete intron removal from all the tRNA-TrpccA 
genes conducted by repetitive replacement of the chromosomal 
tRNA genes with an intronless allele caused minimal effects on 
yeast growth under various conditions. In co-culture experiments 
with the wild-type strain up to 50 generations, the intronless 
strain even showed slight advantage over the wild type in growth. 
Furthermore, intron removal neither affects the amount and 
aminoacylation level of tRNA-TrpccA> nor the protein synthesis 
rate while minor changes in protein composition were detected 
in 2D-PAGE analysis (Mori et al., 2011). Because there exists 
no other tRNA to decode Trp codons on the yeast genome, the 
above results indicate that the intron of tRNA-TrpccA is dis- 
pensable for viability of S. cerevisiae. Therefore, not strict but 
rather subtle effects to have introns in genes of certain isodecoder 
tRNAs may be advantageous for organisms during evolutional 
time scale under the natural conditions. Or, intron insertion to 
and intron loss from tRNA genes are rather neutral for organisms 
in evolution. 

APPEARANCE OF INTRON-CONTAINING tRNA GENES AND 
ORIGIN OF INTRONS 

ARCHAEBACTERIAL INTRON: ESTABLISHMENT OF ARCHETYPAL 
INTRON-CONTAINING tRNA GENES AND MODERN 
INTRON-CONTAINING GENES 

There is a big debate on the origin of tRNA introns. One group 
argues for the "intron-first" scenario: all the primordial tRNA 
genes had an intervening sequence to be removed on the anti- 
codon loop or consisted of a set of two halves, and then a large 
part of such introns have lost from the tRNA genes or the two 
halves have been joined on the genome during evolution (Di 
Giulio, 1992, 2006). The other argues for the intron-late sce- 
nario that introns were inserted after primordial tRNA genes 
had been established (Cavalier- Smith, 1991). From the "simple- 
to-complex" view, the hypothesis that tRNA is derived from 
a tandem duplication of a short stem-loop or ligation of two 
fragments may be reasonable. Indeed, existence of split tRNAs, 
especially those create multiple tRNAs by combination of 5'- and 
3'-halves, fits to this idea (Di Giulio, 2006). Phylogenetic anal- 
ysis of archaeal tRNA genes also suggests that 5'- and 3' -halves 
of tRNA genes even without introns have evolved through dif- 
ferent evolutional trails (Fujishima et al., 2008). One and critical 
problem of this hypothesis is that functional units of a tRNA 
molecule, a decoding unit and an amino acid-accepting unit, do 
not correspond to these possible ancestral stem loops (Figure 1). 

On the other hand, comparison of closely related species sup- 
ports the idea that some "modern" introns have been inserted 
into tRNA genes afterwards. Many tRNA genes encoding differ- 
ent isodecoders in Thermoproteales genomes possess introns with 
very similar sequences at canonical and non-canonical positions 
despite the fact that the tRNA bodies are mapped at differ- 
ent phylogenetic positions (Fujishima et al., 2010). This finding 
suggests that large-scale intron transposition occurred in this 
order of archaea after the species had been established. Although 
the mechanism of intron transposition is still unknown, this 
is allowed because the BHB motif is required at exon-intron 
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junctions and may be able to specify possible transposition targets 
on the chromosome. Thus, this type of intron gain by archaeal 
tRNA genes is supposed to proceed in the DNA level. 

EUKARYOTIC INTRON: POSSIBILITY OF "RECENT " GAIN OF tRNA 
INTRON 

Eukaryotes are thought to have evolved from an ancestral 
prokaryote more closely related to archaebacteria than eubac- 
teria. So that, most of modern eukaryotic species might have 
lost archetypal intron-containing tRNA genes, which may be 
found in some of the archaeal genomes. Spectra of tRNA genes 
harboring an intron in eukaryotic genomes are not correlated 
with evolutional trails. Indeed, certain eukaryotic species pos- 
sess intron-containing genes for a certain isodecoder tRNA whUe 
other species in the same clade do not. For example, the yeast 
Lachancea thermotolerans in Saccharomycetaceae has 3 intron- 
containing tRNA-AlauGC genes and 3 intron-containing tRNA- 
LeucAG genes while other Saccharomycetaceae yeasts have only 
intron-less genes for these isodecoders (Figure 6A). From the 
sequencing analysis, the intron-containing tRNA-LeucAG genes 
seem to have emerged through codon switching by U34C muta- 
tion from tRNA-LeuuAG> which is mostly encoded by intron- 
containing genes in the Saccharomycetaceae yeasts. Thus, unique 
appearance of an intron on genes for a certain isodecoder tRNA 
does not necessarily mean intron gain of the tRNA genes dur- 
ing evolution. On the other hand, there is no tRNA paralogs 
encoded by intron-containing genes for tRNA-AlauGC in the 



Saccharomycetaceae yeasts. Mature parts of this isodecoder from 
L. thermotolerans are highly homologous to those of the same 
isodecoder from other related yeasts. Thus, tRNA-Alaucc genes 
in L. thermotolerans probably acquired their intron during evo- 
lution. Another unique piece of evidence for gain of a modern 
intron by tRNA genes is tRNA-Glyucc in Kluyveromyces lactis. In 
Saccharomycetaceae, only K. lactis has two tRNA-Glyucc genes 
with a eukaryotic intron. Surprisingly, the sequence of mature 
tRNA-Glyucc from K. lactis is less homologous to those of the 
same isodecoder from other yeasts, including those from the most 
related genera such as Lachancea and Eremothecium (Figure 6B), 
but more homologous to tRNA-Gluuuc from a eubacterium, 
Lactobacillus plantarum, where no protein-spliced intron exists in 
tRNA genes (Figure 6C). These situations strongly suggest that a 
bacterial tRNA-Gluuuc gene was horizontally transferred to the 
genome of K. lactis or its close ancestor and was converted into 
tRNA-Glyucc by U35C mutation. Then, the tRNA-Glyucc gene 
seems to have acquired an intron. Thus, some, if not all, of introns 
in eukaryotic tRNA genes are suggested to have modern origins. 
There are many opposite cases, in which only one species lacks the 
intron of a certain isodecoder tRNA while the same isodecoder of 
other related species is encoded by intron-containing genes (see 
tRNA-PhcGAA in Figure 6A), suggesting intron-loss from tRNA 
genes during evolution. 

Although it is easy to postulate ways of intron-loss from tRNA 
genes during evolution, how can a eukaryotic tRNA gene gain 
such a modern intron at the canonical position? Not like the 



A 


A 


G 


u 


c 




A 


P.e/AAAUUUUUU 


^mOIIIIIlI] 


raiMTTTTTISESaTTTTTl 


A 


G 


Pha/GAAnnninn 


Lau/GAG|||||n 


lle/GAU OOOOOC 


Val/GAC 000000 


U 


Leu/UAA mm 


Leu/UAG onnnnn 




Va./UAC IIIIIO 




C 


L.U/CAA 000001 


Lau/CAG 000001 




Va< CAC llllll 



Q^siiiiij^^niiiJiaiiiiL^aiiiii] 



Ser/GGA OQOOOO 




Tyr/GUA 000000 



^'pIIIIII 



CsGCAlUm 



■pIIIIII 



Trp/CCA OnOOOl 



'lit 11 mill 




111 11 11 11 11. 




-111111111111 



i 



mm 



mmi 



m 



ooooor °iy/'^'^'> 000000 



nooonr 



Giy Gcc mm 



Arg/CCG 



mil 



S. cerevisiae 
C. glabrata 
■ L. thermotolerans 
K. lactis 
£ gossypii 
hansenli 



MrTTTin^TTTTTl 



nonono'^-a^"'^" iiiiii ii i Qii 



IM Iilili. M lilili] 



mm 

JtRNA gene(s) without intron 
QtRNA gene(s) with intron 
Qno tRNA gene 
Jtermination condon 



S. cerevisiae 
C. glabrata 
L. thermotolerans 
K. lactis 
— E. gossypii 
D. hansenli 



yeast 
(eukaryote) 



Dha GGGCGGTTGGTGTAGTGGT . TAACATATTTGCCTTCCA AGCAGGTGACATGGGTTCGATTCCCGTACCGCTCA 

See GGGCGGTTAGTGrAGrGGr . TATCATCCCACCCTTCCA AGGTGGGGACACGGGTTCGATTCTCGTACCGCTCA 

Cg! GGGCGGTTAGTGTAGTGGT . TATCATTCCACCCTTCCA AGGTGGAGACACGGGTTCGATTCTCGTACCGCTCA 

Lth GGGCGGTTAGTGTAGTGGT . TATCATCCCACCCTTCCA AGGTGGGGACACGGGITCGATICTCGTACCGCTCA 

Ego GGGCGGTTAGTGTAGTGGT . TATCATTCCACCCTTCCA AGGTGGAGACACGGGTTCGATTCTCGTACCGCTCA 



K!a GACACTGTAGTCTAGTGGTTTAGGATACTCCCCTTCCAtaacaatgatcttcaggatcataggcgAGGGAGTGACACGGGTTCGAAICCCGTCAGTGTCA 

Lpl GACCCCTTGGTCAAGTGGTTTAAGACACTGCCCTTTCA CGGCAGIAACATGGGITCGAAICCCGTAGGGGTCA 

Cfe GGCCCCTTGGTCAAGTGGT . TAAGACACCACCCTTTCA CGGTGGTAACATGGGTTCGAATCCCGTAGGGGTCA 



FIGURE 6 I Distribution of intron-containing tRNA genes in yeast 
genomes. Distribution of intron-containing genes in 6 representative yeast 
genomes (Saccharomyces cerevisiae, Candida glabrata, Lachancea 
thermotolerans, Kluyveromyces lactis. Eremothecium gossypii, and 
Debaryomyces hansenli) is summarized in the anti-codon table (A). As 
illustrated in the right, existence of intron-containing (orange) or intronless 
(blue) tRNA genes for each anticodon position is color-coded. Anticodon 
positions to which no tRNA genes are assigned on the genome or those 
correspond to termination codons are shown in white and black, respectively. 



The phylogenetic tree of these yeasts shown in (B) was drawn according to 
Genolevures Consortium (2009). Among these yeast, D. hansenli does not 
belong to Saccharomycetaceae and is used as an outgroup. (C) Sequence 
comparison of tRNA-Glyucc from D. hansenli [Dha], S. cerevisiae (See), 
C. glabrata iCgi), L. thermotolerans {Lth), E. gossypii {Ego), and K. lactis {Kla), 
and tRNA-Gluuuc from Lactobacillus plantarum {Lpl) and Clostridium tetanus 
{Cte). Nucleotides identical to those of K. lactis tRNA-Glyucc (marked with a 
frame) are shown in blue, and anticodons are highlighted with orange 
shadow. 
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case of archaea, sequences of exon-intron junctions in eukaryotic 
pre-tRNAs do not show clear structural characteristics as men- 
tioned above. Therefore, it is difficult to imagine mechanisms 
by which a DNA fragment corresponding to a tRNA intron 
is transposed to another position on a chromosome. Another 
possibility is that intron is inserted into a tRNA gene via a reverse- 
transcribed intermediate. It is to be noted that the anticodon 
loop, into which eukaryotic tRNA introns are mostly inserted, is 
often the target of tRNA cleavage enzymes. To recognize a codon 
on mRNAs, the anticodon loop must be accessible to macro- 
molecules while the D- and T^C-loops interact with each other 
via base-pairing to form a tight structure. Thus, the anticodon 
loop is an Achilles' tendon for the tRNA to receive undesired 
cleavages. Well-known examples are so-called ribotoxins, such 
as colicin E5, and PrrC endonucleases produced by Escherichia 
coli, which cleave the anticodon of certain tRNAs to kill sur- 
rounding competitor bacteria and to kill itself when infected 
by a deadly pathogen, bacteriophage, respectively (Kaufmann, 
2000). Similar anticodon nucleases are also found in eukaryotes. 
y-Subunit of zymocin, produced by K. lactis, enters S. cerevisiae 
cells and cleaves tRNA-Gluuuc if U34 of the anticodon is modi- 
fied to mcm^s^U (Lu et al, 2005). Eukaryotic cells also have tRNA 
cleavage enzymes whose targets are tRNAs in their own cells, 
such as Rnyl in yeast and angiogenin in mammals (Thompson 
and Parker, 2009; Yamasaki et al., 2009). tRNA cleavage activi- 
ties of these endonucleases are regulated by cellular stresses, and 
the resulting tRNA fragments are used to suppress translation 
(Ivanov et al., 2011; Luhtala and Parker, 2012). Interestingly, the 
cleavage catalyzed by zymocin cannot be healed by tRNA lig- 
ase of S. cerevisiae, while heterologous expression of A. thaliana 
tRNA ligase can rescue S. cerevisiae from the deleterious effect 
of zymocin (Nandakumar et al., 2008). Probably, cleavage of 
the anticodon loop by these endonucleases is carried out so as 
to escape emerged RNA termini from endogenous tRNA ligase. 
Thus, the termini may provide initiation sites from which RNA- 
dependent RNA polymerase extends the 5'-tRNA halves with 
unidentified templates, and then eventually its 3' -terminus may 
be ligated with the 5'-terminus of the 3'-half by tRNA ligase if 
these ends are recognized by healing-sealing activity of tRNA lig- 
ase (Schwer et al., 2004; Nandakumar et al., 2008). Even though 
such cleavage, elongation and healing might occur in the cyto- 
plasm, the resulting tRNA derivatives might be re-imported into 
the nucleus using tRNA import machinery (Shaheen and Hopper, 
2005; Takano et al., 2005). Thus, the tRNA derivatives can be sub- 
jected to reverse transcription and integration into the genome 
in the nucleus. Or, if the cleavage occurs in the nucleus, reverse 
transcriptase may directly elongate the 5' fragment. 

Both loss and gain of tRNA introns seem to have occurred dur- 
ing various stages of eukaryotic evolution. In addition to such 
events, eukaryotic genomes require another layer of gene arrange- 
ments to settle their tRNA gene repertoires. AH the genes encoding 
a certain isodecoder tRNA on a eukaryotic genome tend to have 
an intron or they do not at all. Even in the cases that an isodecoder 
tRNA is encoded by a mixture of intron-containing and intron- 
less genes, those intron-containing and intron-less genes are 
supposed to have different origins. For example, human tRNA- 
LeucAA is encoded by total seven genes; one intron-containing 



gene on Chr I, four intron-containing genes on Chr VI, one 
intron-less gene on Chr I, and one intron-less gene on Chr XL 
Sequence comparison revealed that the intron-less gene on Chr I 
is more related to tRNA-Met in dog and mouse than the human 
intron-containing tRNA-LeucAA genes while that on Chr XI is 
more related to tRNA-LeuuAG in human and zebrafish. Therefore, 
sequences of tRNA genes with the same origin are supposed to 
be equalized including their intronic part. This is also confirmed 
among species. If sequences of tRNA-TrpccA genes are compared 
among fungal genomes, those from the same organism are clus- 
tered (Mori et al., 2011). Intergenic conversion between highly 
homologous tRNA genes for an isodecoder may contribute to 
this sequence equalization. Indeed, intergenic spreading of muta- 
tions among tRNA genes of the same isodecoder have been known 
for years as intergenic gene conversion hot spots (Amstutz et al., 
1985; Heyer et al, 1986). Although tRNA genes are scattered 
around the whole yeast chromosomes, they are closely located 
around the nucleolus in S. cerevisiae, and near centromeres 
in Schizosaccharomyces pombe, which has only three chromo- 
somes, and this intra-nuclear localization is driven by interaction 
between TFIIIC and condensin (Thompson et al., 2003; Haeusler 
et al., 2008; Iwasaki et al., 2010). Such spatial positioning of 
tRNA genes in the nucleus is supposed to help efficient equal- 
ization of tRNA sequences through intergenic gene conversion. 
We still do not know what determines balance between domi- 
nation of intron-containing genes and that of intron-less genes. 
As described before, there seems to be no essential difference in 
function of tRNA molecules derived from intron-containing and 
intron-less genes. It might be possible that domination of intron- 
containing or intron-less tRNA genes for an isodecoder tRNA just 
comes from stochastic change between the two states during the 
evolution: if the one of the two types occupies all the tRNA loci, 
no change becomes possible until a new and very rare event of 
intron-loss or intron-gain occurs. 

Here, I have gone through recent progresses related to tRNA 
introns found in both archaea and eukaryotes. The life of a 
tRNA molecule transcribed from intron-containing genes has 
been studied for decades from various points of views, and 
has provided various interesting findings in bioinformatics, in 
molecular biology, in cell biology, and even in human pathol- 
ogy. However, accumulating information from the experimental 
and computational analyses has brought us more questions to 
be solved, especially those related to eukaryotic tRNA introns 
and their splicing. Still, we do not understand completely how 
eukaryotic splicing endonuclease decodes hidden motifs of splice 
sites in eukaryotic pre-tRNAs. What kinds of substrates other 
than pre-tRNAs does splicing machinery process in archaea and 
eukaryotes? Why can tRNA introns be spliced in different intra- 
cellular compartments among eukaryotes? Are there any other 
physiological roles of tRNA introns in the life of tRNA or those 
as genetical elements on the chromosomes? What are real selec- 
tive pressures to keep introns in tRNA genes if existing? Most 
importantly, what are origins of modern tRNA introns, and how 
have they appeared and disappeared in archaeal and eukaryotic 
genomes during evolution? Some of the questions may not be able 
to be answered easily, but our quest of the life of tRNAs to seek the 
answers is going on. 
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